75 research outputs found
Keyframe-based monocular SLAM: design, survey, and future directions
Extensive research in the field of monocular SLAM for the past fifteen years
has yielded workable systems that found their way into various applications in
robotics and augmented reality. Although filter-based monocular SLAM systems
were common at some time, the more efficient keyframe-based solutions are
becoming the de facto methodology for building a monocular SLAM system. The
objective of this paper is threefold: first, the paper serves as a guideline
for people seeking to design their own monocular SLAM according to specific
environmental constraints. Second, it presents a survey that covers the various
keyframe-based monocular SLAM systems in the literature, detailing the
components of their implementation, and critically assessing the specific
strategies made in each proposed solution. Third, the paper provides insight
into the direction of future research in this field, to address the major
limitations still facing monocular SLAM; namely, in the issues of illumination
changes, initialization, highly dynamic motion, poorly textured scenes,
repetitive textures, map maintenance, and failure recovery
A Unified Hybrid Formulation for Visual SLAM
Visual Simultaneous Localization and Mapping (Visual SLAM (VSLAM)), is the process of estimating the six degrees of freedom ego-motion of a camera, from its video feed, while simultaneously constructing a 3D model of the observed environment. Extensive research in the field for the past two decades has yielded real-time and efficient algorithms for VSLAM, allowing various interesting applications in augmented reality, cultural heritage, robotics and the automotive industry, to name a few. The underlying formula behind VSLAM is a mixture of image processing, geometry, graph theory, optimization and machine learning; the theoretical and practical development of these building blocks led to a wide variety of algorithms, each leveraging different assumptions to achieve superiority under the presumed conditions of operation. An exhaustive survey on the topic outlined seven main components in a generic VSLAM pipeline, namely: the matching paradigm, visual initialization, data association, pose estimation, topological/metric map generation, optimization, and global localization. Before claiming VSLAM a solved problem, numerous challenging subjects pertaining to robustness in each of the aforementioned components have to be addressed; namely: resilience to a wide variety of scenes (poorly textured or self repeating scenarios), resilience to dynamic changes (moving objects), and scalability for long-term operation (computational resources awareness and management). Furthermore, current state-of-the art VSLAM pipelines are tailored towards static, basic point cloud reconstructions, an impediment to perception applications such as path planning, obstacle avoidance and object tracking. To address these limitations, this work proposes a hybrid scene representation, where different sources of information extracted solely from the video feed are fused in a hybrid VSLAM system. The proposed pipeline allows for seamless integration of data from pixel-based intensity measurements and geometric entities to produce and make use of a coherent scene representation. The goal is threefold: 1) Increase camera tracking accuracy under challenging motions, 2) improve robustness to challenging poorly textured environments and varying illumination conditions, and 3) ensure scalability and long-term operation by efficiently maintaining a global reusable map representation
Compact resettable counters through causal stability
Conflict-free Data Types (CRDTs) were designed to automatically resolve conflicts in eventually consistent systems. Different CRDTs were designed in both operation-based and state-based flavors such as Counters, Sets, Registers, Maps, etc. In a previous paper [2], Baquero et al. presented the problem with embedded CRDT counters and a solution, covering state-based counters that can be embedded in maps, but needing an ad-hoc extension to the standard counter API. Here, we present a resettable operation-based counter design, with the standard simple API and small state, through a causalstability- based state compaction.Project "Coral - Sustainable Ocean Exploitation: Tools and Sensors/NORTE-01-0145-FEDER-000036" is financed by the North Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, and through the European Regional Development Fund (ERDF).
The research leading to these results has received funding from the European Union’s Horizon 2020 - The EU Framework Programme for Research and Innovation 2014-2020, under grant agreement No. 732505, project LightKone.
Project "TEC4Growth - Pervasive Intelligence, Enhancers and Proofs of Concept with Industrial Impact/NORTE-01-0145-FEDER-000020" is financed by the North Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership. Agreement, and through the European Regional Development Fund (ERDF)
H-SLAM: Hybrid Direct-Indirect Visual SLAM
The recent success of hybrid methods in monocular odometry has led to many
attempts to generalize the performance gains to hybrid monocular SLAM. However,
most attempts fall short in several respects, with the most prominent issue
being the need for two different map representations (local and global maps),
with each requiring different, computationally expensive, and often redundant
processes to maintain. Moreover, these maps tend to drift with respect to each
other, resulting in contradicting pose and scene estimates, and leading to
catastrophic failure. In this paper, we propose a novel approach that makes use
of descriptor sharing to generate a single inverse depth scene representation.
This representation can be used locally, queried globally to perform loop
closure, and has the ability to re-activate previously observed map points
after redundant points are marginalized from the local map, eliminating the
need for separate and redundant map maintenance processes. The maps generated
by our method exhibit no drift between each other, and can be computed at a
fraction of the computational cost and memory footprint required by other
monocular SLAM systems. Despite the reduced resource requirements, the proposed
approach maintains its robustness and accuracy, delivering performance
comparable to state-of-the-art SLAM methods (e.g., LDSO, ORB-SLAM3) on the
majority of sequences from well-known datasets like EuRoC, KITTI, and TUM VI.
The source code is available at: https://github.com/AUBVRL/fslam_ros_docker
Integration challenges of pure operation-based CRDTs in redis
Pure operation-based (op-based) Conflict-free Replicated Data Types (CRDTs) are generic and very efficient as they allow for compact solutions in both sent messages and state size. Although the pure op-based model looks promising, it is still not fully understood in terms of practical implementation. In this paper, we explain the challenges faced in implementing pure op-based CRDTs in a real system: the well-known in-memory cache key-value store Redis. Our purpose of choosing Redis is to implement a multi-master replication feature, which the current system lacks. The experience demonstrates that pure op-based CRDTs can be implemented in existing systems with minor changes in the original API.European Union Seventh Framework Program (FP7/2007-2013) under grant agreement 609551, SyncFree project.
Project “TEC4Growth - Pervasive Intelligence, Enhancers and Proofs of Concept with Industrial Impact/NORTE-01-0145-FEDER-000020”is financed by the North Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, and through the European Regional Development Fund (ERDF).info:eu-repo/semantics/publishedVersio
Structure modifications of hydrolytically degradable polymer flocculant for improved water recovery from mature fine tailings
Oil sands mining operations in Canada produce large volumes of waste tailings that are difficult to dewater using commercial polyacrylamide-based flocculants. Recently, we have developed a novel hydrolytically-degradable polymer synthesized through micellar radical polymerization of short-chain polyester cationic macromonomers. Poly(PCL2ChMA), made of polycaprolactone choline iodide ester methacrylate with two polyester units, effectively treated mature fine tailings (MFT) solutions as evaluated by measuring initial settling rate, supernatant turbidity, and capillary suction time (CST) of the sediments[1].
Please download the file below for full content
Spatial Detection of Vehicles in Images using Convolutional Neural Networks and Stereo Matching
Convolutional Neural Networks combined with a state of the artstereo-matching method are used to find and estimate the 3D positionof vehicles in pairs of stereo images. Pixel positions of vehiclesare first estimated separately in pairs of stereo images usinga Convolutional Neural Network for regression. These coordinatesare then combined with a state-of-art stereo-matching method todetermine the depth, and thus the 3D location, of the vehicles. Weshow in this paper that cars can be detected with a combined accuracyof approximately 90% with a tolerated radius error of 5%,and a Mean Absolute Error of 5.25m on depth estimation for carsup to 50m away
IR Shape From Shading Enhanced RGBD for 3D Scanning
RGBD Cameras such as the Microsoft Kinect that can quickly provideusable depth maps have become very affordable, and thusvery popular and abundant in recent years. Beyond gaming, RGBDcameras can have numerous applications, including their use in affordable3D scanners. These cameras however are limited in theirability to capture finer details. We explore the use of additional3D reconstruction algorithms to enhance the depth maps producedfrom RGBD cameras, allowing them to capture more detail
- …